Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data

Identifieur interne : 000C59 ( Main/Exploration ); précédent : 000C58; suivant : 000C60

Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data

Auteurs : Kosai Al-Nakeeb ; Thomas Nordahl Petersen ; Thomas Sicheritz-Pontén

Source :

RBID : PMC:5699183

Descripteurs français

English descriptors

Abstract

Background

Whole-genome sequencing (WGS) projects provide short read nucleotide sequences from nuclear and possibly organelle DNA depending on the source of origin. Mitochondrial DNA is present in animals and fungi, while plants contain DNA from both mitochondria and chloroplasts. Current techniques for separating organelle reads from nuclear reads in WGS data require full reference or partial seed sequences for assembling.

Results

Norgal (de Novo ORGAneLle extractor) avoids this requirement by identifying a high frequency subset of k-mers that are predominantly of mitochondrial origin and performing a de novo assembly on a subset of reads that contains these k-mers. The method was applied to WGS data from a panda, brown algae seaweed, butterfly and filamentous fungus. We were able to extract full circular mitochondrial genomes and obtained sequence identities to the reference sequences in the range from 98.5 to 99.5%. We also assembled the chloroplasts of grape vines and cucumbers using Norgal together with seed-based de novo assemblers.

Conclusion

Norgal is a pipeline that can extract and assemble full or partial mitochondrial and chloroplast genomes from WGS short reads without prior knowledge. The program is available at: https://bitbucket.org/kosaidtu/norgal.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-017-1927-y) contains supplementary material, which is available to authorized users.


Url:
DOI: 10.1186/s12859-017-1927-y
PubMed: 29162031
PubMed Central: 5699183


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data</title>
<author>
<name sortKey="Al Nakeeb, Kosai" sort="Al Nakeeb, Kosai" uniqKey="Al Nakeeb K" first="Kosai" last="Al-Nakeeb">Kosai Al-Nakeeb</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Petersen, Thomas Nordahl" sort="Petersen, Thomas Nordahl" uniqKey="Petersen T" first="Thomas Nordahl" last="Petersen">Thomas Nordahl Petersen</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Sicheritz Ponten, Thomas" sort="Sicheritz Ponten, Thomas" uniqKey="Sicheritz Ponten T" first="Thomas" last="Sicheritz-Pontén">Thomas Sicheritz-Pontén</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">29162031</idno>
<idno type="pmc">5699183</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5699183</idno>
<idno type="RBID">PMC:5699183</idno>
<idno type="doi">10.1186/s12859-017-1927-y</idno>
<date when="2017">2017</date>
<idno type="wicri:Area/Pmc/Corpus">000270</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000270</idno>
<idno type="wicri:Area/Pmc/Curation">000270</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000270</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000765</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Checkpoint">000765</idno>
<idno type="wicri:source">PubMed</idno>
<idno type="RBID">pubmed:29162031</idno>
<idno type="wicri:Area/PubMed/Corpus">000A77</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">000A77</idno>
<idno type="wicri:Area/PubMed/Curation">000A77</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">000A77</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000B92</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">000B92</idno>
<idno type="wicri:Area/Ncbi/Merge">001C57</idno>
<idno type="wicri:Area/Ncbi/Curation">001C57</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">001C57</idno>
<idno type="wicri:Area/Main/Merge">000C62</idno>
<idno type="wicri:Area/Main/Curation">000C59</idno>
<idno type="wicri:Area/Main/Exploration">000C59</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data</title>
<author>
<name sortKey="Al Nakeeb, Kosai" sort="Al Nakeeb, Kosai" uniqKey="Al Nakeeb K" first="Kosai" last="Al-Nakeeb">Kosai Al-Nakeeb</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Petersen, Thomas Nordahl" sort="Petersen, Thomas Nordahl" uniqKey="Petersen T" first="Thomas Nordahl" last="Petersen">Thomas Nordahl Petersen</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Sicheritz Ponten, Thomas" sort="Sicheritz Ponten, Thomas" uniqKey="Sicheritz Ponten T" first="Thomas" last="Sicheritz-Pontén">Thomas Sicheritz-Pontén</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint>
<date when="2017">2017</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Animals</term>
<term>DNA, Chloroplast (genetics)</term>
<term>DNA, Mitochondrial (genetics)</term>
<term>Genome, Chloroplast</term>
<term>Genome, Mitochondrial</term>
<term>Software</term>
<term>Whole Genome Sequencing (methods)</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>ADN des chloroplastes (génétique)</term>
<term>ADN mitochondrial (génétique)</term>
<term>Animaux</term>
<term>Génome de chloroplaste</term>
<term>Génome mitochondrial</term>
<term>Logiciel</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="genetics" xml:lang="en">
<term>DNA, Chloroplast</term>
<term>DNA, Mitochondrial</term>
</keywords>
<keywords scheme="MESH" qualifier="génétique" xml:lang="fr">
<term>ADN des chloroplastes</term>
<term>ADN mitochondrial</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Whole Genome Sequencing</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Animals</term>
<term>Genome, Chloroplast</term>
<term>Genome, Mitochondrial</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Animaux</term>
<term>Génome de chloroplaste</term>
<term>Génome mitochondrial</term>
<term>Logiciel</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>Whole-genome sequencing (WGS) projects provide short read nucleotide sequences from nuclear and possibly organelle DNA depending on the source of origin. Mitochondrial DNA is present in animals and fungi, while plants contain DNA from both mitochondria and chloroplasts. Current techniques for separating organelle reads from nuclear reads in WGS data require full reference or partial seed sequences for assembling.</p>
</sec>
<sec>
<title>Results</title>
<p>Norgal (de Novo ORGAneLle extractor) avoids this requirement by identifying a high frequency subset of k-mers that are predominantly of mitochondrial origin and performing a de novo assembly on a subset of reads that contains these k-mers. The method was applied to WGS data from a panda, brown algae seaweed, butterfly and filamentous fungus. We were able to extract full circular mitochondrial genomes and obtained sequence identities to the reference sequences in the range from 98.5 to 99.5%. We also assembled the chloroplasts of grape vines and cucumbers using Norgal together with seed-based de novo assemblers.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>Norgal is a pipeline that can extract and assemble full or partial mitochondrial and chloroplast genomes from WGS short reads without prior knowledge. The program is available at:
<ext-link ext-link-type="uri" xlink:href="https://bitbucket.org/kosaidtu/norgal">https://bitbucket.org/kosaidtu/norgal</ext-link>
.</p>
</sec>
<sec>
<title>Electronic supplementary material</title>
<p>The online version of this article (doi:10.1186/s12859-017-1927-y) contains supplementary material, which is available to authorized users.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Bruggen, Efjv" uniqKey="Bruggen E">EFJV Bruggen</name>
</author>
<author>
<name sortKey="Borst, P" uniqKey="Borst P">P Borst</name>
</author>
<author>
<name sortKey="Ruttenberg, Gjcm" uniqKey="Ruttenberg G">GJCM Ruttenberg</name>
</author>
<author>
<name sortKey="Gruber, M" uniqKey="Gruber M">M Gruber</name>
</author>
<author>
<name sortKey="Kroon, Am" uniqKey="Kroon A">AM Kroon</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hahn, C" uniqKey="Hahn C">C Hahn</name>
</author>
<author>
<name sortKey="Bachmann, L" uniqKey="Bachmann L">L Bachmann</name>
</author>
<author>
<name sortKey="Chevreux, B" uniqKey="Chevreux B">B Chevreux</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Dierckxsens, N" uniqKey="Dierckxsens N">N Dierckxsens</name>
</author>
<author>
<name sortKey="Mardulyn, P" uniqKey="Mardulyn P">P Mardulyn</name>
</author>
<author>
<name sortKey="Smits, G" uniqKey="Smits G">G Smits</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Robin, Ed" uniqKey="Robin E">ED Robin</name>
</author>
<author>
<name sortKey="Wong, R" uniqKey="Wong R">R Wong</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Haddad, Nj" uniqKey="Haddad N">NJ Haddad</name>
</author>
<author>
<name sortKey="Al Nakeeb, K" uniqKey="Al Nakeeb K">K Al-Nakeeb</name>
</author>
<author>
<name sortKey="Petersen, B" uniqKey="Petersen B">B Petersen</name>
</author>
<author>
<name sortKey="Dalen, L" uniqKey="Dalen L">L Dalén</name>
</author>
<author>
<name sortKey="Blom, N" uniqKey="Blom N">N Blom</name>
</author>
<author>
<name sortKey="Sicheritz Ponten, T" uniqKey="Sicheritz Ponten T">T Sicheritz-Pontén</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Schubert, M" uniqKey="Schubert M">M Schubert</name>
</author>
<author>
<name sortKey="Lindgreen, S" uniqKey="Lindgreen S">S Lindgreen</name>
</author>
<author>
<name sortKey="Orlando, L" uniqKey="Orlando L">L Orlando</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, D" uniqKey="Li D">D Li</name>
</author>
<author>
<name sortKey="Liu, Cm" uniqKey="Liu C">CM Liu</name>
</author>
<author>
<name sortKey="Luo, R" uniqKey="Luo R">R Luo</name>
</author>
<author>
<name sortKey="Sadakane, K" uniqKey="Sadakane K">K Sadakane</name>
</author>
<author>
<name sortKey="Lam, Tw" uniqKey="Lam T">TW Lam</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, H" uniqKey="Li H">H Li</name>
</author>
<author>
<name sortKey="Durbin, R" uniqKey="Durbin R">R Durbin</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Peng, Y" uniqKey="Peng Y">Y Peng</name>
</author>
<author>
<name sortKey="Leung, Hcm" uniqKey="Leung H">HCM Leung</name>
</author>
<author>
<name sortKey="Yiu, Sm" uniqKey="Yiu S">SM Yiu</name>
</author>
<author>
<name sortKey="Chin, Fyl" uniqKey="Chin F">FYL Chin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kelley, Dr" uniqKey="Kelley D">DR Kelley</name>
</author>
<author>
<name sortKey="Schatz, Mc" uniqKey="Schatz M">MC Schatz</name>
</author>
<author>
<name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Camacho, C" uniqKey="Camacho C">C Camacho</name>
</author>
<author>
<name sortKey="Coulouris, G" uniqKey="Coulouris G">G Coulouris</name>
</author>
<author>
<name sortKey="Avagyan, V" uniqKey="Avagyan V">V Avagyan</name>
</author>
<author>
<name sortKey="Ma, N" uniqKey="Ma N">N Ma</name>
</author>
<author>
<name sortKey="Papadopoulos, J" uniqKey="Papadopoulos J">J Papadopoulos</name>
</author>
<author>
<name sortKey="Bealer, K" uniqKey="Bealer K">K Bealer</name>
</author>
<author>
<name sortKey="Madden, Tl" uniqKey="Madden T">TL Madden</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Altschul, Sf" uniqKey="Altschul S">SF Altschul</name>
</author>
<author>
<name sortKey="Gish, W" uniqKey="Gish W">W Gish</name>
</author>
<author>
<name sortKey="Miller, W" uniqKey="Miller W">W Miller</name>
</author>
<author>
<name sortKey="Myers, Ew" uniqKey="Myers E">EW Myers</name>
</author>
<author>
<name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Aquadro, Cf" uniqKey="Aquadro C">CF Aquadro</name>
</author>
<author>
<name sortKey="Greenberg, Bd" uniqKey="Greenberg B">BD Greenberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ward, Bl" uniqKey="Ward B">BL Ward</name>
</author>
<author>
<name sortKey="Anderson, Rs" uniqKey="Anderson R">RS Anderson</name>
</author>
<author>
<name sortKey="Bendich, Aj" uniqKey="Bendich A">AJ Bendich</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lopez, Jv" uniqKey="Lopez J">JV Lopez</name>
</author>
<author>
<name sortKey="Yuhki, N" uniqKey="Yuhki N">N Yuhki</name>
</author>
<author>
<name sortKey="Masuda, R" uniqKey="Masuda R">R Masuda</name>
</author>
<author>
<name sortKey="Modi, W" uniqKey="Modi W">W Modi</name>
</author>
<author>
<name sortKey="O Rien, Sj" uniqKey="O Rien S">SJ O’Brien</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<affiliations>
<list></list>
<tree>
<noCountry>
<name sortKey="Al Nakeeb, Kosai" sort="Al Nakeeb, Kosai" uniqKey="Al Nakeeb K" first="Kosai" last="Al-Nakeeb">Kosai Al-Nakeeb</name>
<name sortKey="Petersen, Thomas Nordahl" sort="Petersen, Thomas Nordahl" uniqKey="Petersen T" first="Thomas Nordahl" last="Petersen">Thomas Nordahl Petersen</name>
<name sortKey="Sicheritz Ponten, Thomas" sort="Sicheritz Ponten, Thomas" uniqKey="Sicheritz Ponten T" first="Thomas" last="Sicheritz-Pontén">Thomas Sicheritz-Pontén</name>
</noCountry>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000C59 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000C59 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     PMC:5699183
   |texte=   Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:29162031" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021